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Abstract 

Background: Many biomedical applications require the expression or production of therapeutic hetero-multimeric 
proteins/protein complexes: in most cases only accomplished by co-ordinated co-expression within the same cell. 
Foot-and-mouth disease virus 2A (F2A) and '2A-like' sequences are now widely used for this purpose. Since 2A 
mediates a co-translational 'cleavage' at its own C-terminus, sequences encoding multiple proteins (linked via 2As) 
can be concatenated into a single ORF: a single transgene. It has been shown that in some cases, however, the 
cleavage efficiency of shorter versions of F2A may be inhibited by the C-terminus of certain gene sequences 
immediately upstream of F2A. This paper describes further work to optimise F2A for co-expression strategies. 

Results: We have inserted F2A of various lengths in between GFP and CherryFP 'reporter' proteins (in reciprocal or 
tandem arrangements). The co-expression of these proteins and cleavage efficiencies of F2As of various lengths 
were studied by in vitro coupled transcription and translation in rabbit reticulocyte lysates, western blotting of HeLa 
cell lysates and fluorescence microscopy. 

Conclusions: Optimal and suboptimal lengths of F2A sequences were identified as a result of detailed 'fine-tuning' 
of the F2A sequence. Based on our data and the model according to which 2A activity is a product of its 
interaction with the exit tunnel of the ribosome, we suggest the length of the F2A sequence which is not 'sensitive' 
to the C-terminus of the upstream protein that can be successfully used for co-expression of two proteins for 
biomedical applications. 

Keywords: Protein co-expression, Bicistronic vector, Co-translational 'self-cleavage' 



Background 

Many biomedical applications rely on the ability to co- 
express multiple proteins within the same cell These 
may include co-expression of genes encoding subunits 
of hetero-multimeric therapeutic proteins (e.g. im- 
munoglobulins, cell receptors, interleukins, enzymes, 
transcription factors), multiple therapeutic genes for a 
combined and/or synergistic effects (e.g. immune- 
stimulatory), and co-expression of therapeutic genes 
plus required co-factor or marker gene (e.g. drug re- 
sistance, colour, fluorescence) for detection or selec- 
tion of transduced cells (reviewed by de Felipe [1] and 
Pfutzner [2]). 



* Correspondence: minskayakat@hotmail.com 

Biomedical Sciences Research Complex, North Haugh, University of St. 
Andrews, St.Andrews, KY16 9ST Fife, Scotland, UK 



Various conventional approaches for protein co- 
expression have been utilized such as the use of multiple 
internal promoters, internal ribosome entry sites (IRESes), 
messenger RNA splicing, fusion proteins, and post- 
translational proteolysis. There are, however, disadvan- 
tages associated with each of these approaches. For ex- 
ample, steric effects often lead to loss of function of fused 
proteins. In order to avoid such problems, proteinase 
cleavage sites can be incorporated. In this case both the 
polyprotein substrate and the processing enzyme must, 
however, be co-expressed in the same subcellular site. 
When multiple promoters have been employed, interfer- 
ence has occurred between promoters leading to promoter 
suppression and rearrangement. IRES elements, identified 
both in viral and cellular eukaryotic mRNAs, differ in 
length (from 130 bp to 1 kb). The most efficient viral 
IRESes successfully utilized in vectors used for biomedical 
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purposes are about 500 bp. However, their size can cause 
problems in packaging within size-restricted vectors com- 
monly used for biomedical applications such as adeno- 
associated and retroviral vectors. Furthermore, expression 
from IRESes is dependent on various cellular binding 
factors which vary amongst different cell types. Critically, 
the protein downstream of the IRES is only expressed 
to -10% of that upstream [1-7]. 

Picornavirus (foot-and-mouth disease virus; FMDV) 
F2A and '2A-like' sequences are now widely used for co- 
expression of multiple genes. The 2A region of the FMDV 
encodes a sequence that mediates self-processing by a 
translational effect variously referred to as ribosome skip- 
pingl stop-go' and stop-carry on translation [8,9]. Analysis 
of recombinant polyproteins and artificial polyprotein sys- 
tems in which 2A was inserted between two reporter 
proteins showed that the FMDV 2A oligopeptide (plus the 
N-terminal proline of the 2B downstream protein) co- 
translationally self-cleaved' at the glycyl-prolyl pair site 
corresponding to the 2A/2B junction (-LLNFDLLKLA- 
GDVESNPG j P-). The use of longer versions of 2A with 
N-terminal extensions derived from FMDV capsid protein 
ID upstream of 2 A (~30aa in total) was reported to pro- 
duce higher levels of cleavage [10-13]. 

For simplicity, the co-translational ribosome skipping 
event will be referred to as cleavage'. The merits of this 
system are: (i) co-expression of proteins linked by 2A is 
independent of the cell type (since cleavage activity is 
only dependent on eukaryotic ribosomes, structurally 
highly conserved amongst the eukaryota), (ii) multiple 
proteins are co-expressed in equimolar amounts from a 
single transcript mRNA (single ORF) under the control 
of only one promoter and, (iii) 2 A is smaller (54-174 bp) 
compared to IRES elements. This makes this unique se- 
quence an attractive substitute for previously used ap- 
proaches for co-expression of multiple genes [10,14-16]. 
However, it should be noted that (i) 2A remains as a C- 
terminal extension of the upstream product, and (ii) pro- 
line forms the N terminus of the downstream protein. 

To date, various reporter proteins together with pro- 
teins requiring discrete co- and post-translational sub- 
cellular localization have been successfully co-expressed 
in in vitro and in vivo heterologous systems using F2A 
sequences of various lengths [11,17-24]. However, some 
limitations of using this strategy were reported. Firstly, 
whilst the longest F2A sequence (58aa) tested to date 
was shown to produce the most efficient cleavage, the 
C-terminal F2A extension of the upstream translation 
product may have an effect on protein confirmation and 
activity - such as in the production of monoclonal anti- 
bodies or expression of enzymes. To minimise this ef- 
fect, a number of laboratories have used shorter versions 
of F2As [11,14,19,22,25], or, incorporated a furin cleav- 
age site between the C-terminus of the upstream protein 



and N-terminus of the 2A sequence such that the C- 
terminal extension is 'trimmed' away. This approach can 
only be used for secreted proteins, however, since furin 
is primarily localised within the Golgi apparatus [22,26]. 

It has been reported, however, that the efficiency of 
F2A cleavage may be greatly inhibited in certain cases 
where the C-terminal sequences of the upstream gene 
were shown to lower F2A activity [27,28]. It was pro- 
posed that such inhibition could be overcome by the 
use of different lengths of the sequence between the C- 
terminal region and F2A [11]. To further investigate 
F2A cleavage efficiency and to optimise the 2A system 
for the coordinate expression of two genes from a 
bicistronic vector, we generated constructs encoding 
two fluorescent 'reporter' proteins, GFP and CherryFP, 
in different positions linked by F2A sequences of vari- 
ous lengths. The co-expression of these cytoplasmic 
proteins and the comparative cleavage efficiencies were 
determined both in vivo in HeLa cells, a human cell line 
most commonly used in biomedical applications (west- 
ern blotting/fluorescence microscopy), and in vitro 
(coupled transcription/translation system). 

Results 

Co-expression of GFP and CherryFP fluorescent proteins 
from pGFP-F2A-CherryFP constructs 

A series of constructs were produced encoding GFP and 
CherryFP fluorescent reporter' proteins under the control 
of both CMV and T7 promoters. Sequences encoding 
GFP and CherryFP were separated by either the longest 
F2A sequence (F2A 58 ) or N-terminally truncated versions 
of various lengths (F2A 50 , F2 40 , F2A 30 , F2A 2 o> and F2A 18 ; 
Figure 1), to establish the optimal length of F2A sequence 
for co-expression of two proteins and their relative clea- 
vage efficiencies. The expected translation products were 
cleaved [GFP-F2A] and CherryFP, and the uncleaved 
polyprotein [GFP-F2A-CherryFP] (Figure 2a). 

For the in vitro translation studies, rabbit reticulocyte 
lysates were programmed with the pGFP-F2A-CherryFP 
series of constructs and the translation products re- 
solved by 12% SDS-PAGE (Figure 2b). As expected, two 
major translation products (of the expected molecular 
weights) were observed. In each track the upper slower 
migrating product corresponded to either [GFP-F2A 58 ] 
(34.5 kDa), [GFP-F2A 50 ] (33.5 kDa), [GFP-F2A4 0 ] (32.3 kDa) 
or [GFP-F2A 30 ] (31.2 kDa). The second product corres- 
ponded to the CherryFP cleavage product. Translation 
profiles derived from pGFP-F2A-CherryFP constructs 
also showed a low level of internal initiation which pro- 
duced products of lower molecular weights. The bands, 
based on the gradual decrease in molecular weight, 
were comparable to the [GFP-F2A] cleavage products 
and correspond to products produced by leaky scan- 
ning, initiating within GFP at Met88, in favourable 
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capsid protein ID <- T -> 2A ^ 

VTELLYRMKRAETYCPRPLLAIHPTEARHKQKIVAPVKQ TLNFDLLKLAGDVESNPG P 58aa 
KRAETYCPRPLLAIHPTEARHKQKIVAPVKQ TLNFDLLKLAGDVESNPG P 50aa 
LLAIHPTEARHKQKIVAPVKQ TLNFDLLKLAGDVESNPG P 4 0aa 
HKQKIVAPVKQ TLNFDLLKLAGDVESNPG P 30aa 
VAPVKQ TLNFDLLKLAGDVESNPG P 2 5aa 
APVKQ TLNFDLLKLAGDVESNPG P 24aa 
PVKQ TLNFDLLKLAGDVESNPG P 2 3aa 
VKQ TLNFDLLKLAGDVESNPG P 22aa 
KQ TLNFDLLKLAGDVESNPG P 21aa 
Q TLNFDLLKLAGDVESNPG P 2 0aa 
LNFDLLKLAGDVESNPG P 18aa 
Figure 1 F2A sequences used for co-expression of fluorescent 'reporter 7 proteins. 2A mediates the co-translational Yibosome skipping' 
event whilst fused to the upstream capsid protein 1D. Subsequently, 2A is co-translationally cleaved from 1D by an FMD virus proteinase (3C pro ) 
which (artificially) defines the length of 2A: the functional entity is, however, longer than this 18aa tract. The 3C pro cleavage is indicated by the 
solid vertical arrow, the site of the 2A-mediated ribosome skipping event is shown by the open vertical arrow. Versions of various lengths were 
used by producing truncations of the 58aa version. 



Kozak context (cCAUGc, indicated as 'GFP internal ini- 
tiation' in Figure 2b). 

HeLa cells were transfected with this panel of con- 
structs and cell lysates analysed 30 h post transfection. 
Similarly, the two major cleavage products were detected 
by western blotting either with an anti-GFP antibody 
showing [GFP-F2A]s of decreasing size (Figure 2c, upper 
blot), or, an anti-CherryFP antibody (Figure 2c, lower 
blot). Uncleaved polyproteins were also detected, their 
accumulation increasing as the length of F2A decreased, 
being highest for the pGFP-F2A 30 -CherryFP construct. 
These data, together with our in vitro analysis, demon- 
strated that the cleavage efficiency is slightly reduced 
when a shorter version (F2A 30 ) is used, compared to that 
observed for the 58aa version. 

Fluorescence imaging was used to directly visualise co- 
expression of fluorescent proteins. To demonstrate that 
GFP and CherryFP proteins linked via F2A were co- 
expressed within the same cell, we transfected HeLa 
cells with the panel of pGFP-F2A-CherryFP constructs 
(F2A 58 , F2A 50 , F2A 40 and F2A 30 ). In each case 
transfected cells efficiently co-expressed both GFP and 
CherryFP for up to 48 h post-transfection (Figure 2d; 
the images presented for pGFP-F2A 3(r CherryFP were 
representative of all constructs), with both proteins 
localised to the cytoplasm of transfected cells (Figure 2e). 
We could not detect any effect of increasing the length 
of the C-terminal extension of GFP by F2A (30-58aa) 
upon fluorescence. 

In contrast to our constructs with F2A of 58-30 amino 
acids, translation directed by plasmids pGFP-F2A 20 - 
CherryFP and pGFP-F2A 18 -CherryFP showed three major 
translation products (Figure 2b). In each gel track, the 
most slowly migrating band corresponded to the un- 
cleaved polyprotein forms [GFP-F2A 20 -CherryFP] and 
[GFP-F2A 18 -CherryFP] of the expected molecular weights 



(-57 kDa). The bands corresponding to the two cleavage 
products, [GFP-F2A 20 ] or [GFP-F2A 18 ] and CherryFP 
could not be resolved due to the similar molecular 
weights. In addition, three extra minor products were ob- 
served in each of these translation reactions. Internal initi- 
ation within the upstream GFP gene (probably Met88) 
produced both the fastest migrating cleaved product, cor- 
responding to protein of -20 kDa which was almost un- 
detectable on the gel (indicated in Figure 2b as 'GFP 
internal initiation') and the second slow migrating 
uncleaved product of -50 kDa (highlighted with asterisk). 
The third slower migrating minor product corresponded 
to a protein of unknown identity (shown with white 
arrow), although we suspect that this represents a transla- 
tion product with a C-terminal adduct of tRNA. 

Our in vivo studies were consistent with the in vitro 
data: the western blotting data demonstrating that the 
major products were the uncleaved [GFP-F2A 20 -Cherry 
FP] and [GFP-F2A 18 -CherryFP] polyproteins (Figure 2c). 
Small amounts of the CherryFP cleavage product, com- 
pared to F2A 58 - 30 -based constructs, were still detectable 
with anti-CherryFP antibody (Figure 2c, lower blot), the 
upstream cleavage products were, however, either al- 
most undetectable [GFP-F2A 20 ] or absent [GFP-F2A 18 ] 
(Figure 2c, upper blot). It is important to note that the 
amounts of proteins observed in these blots is a function 
of detection using two quite different antibodies, raised 
against either GFP or CherryFP and they have different af- 
finities for their antigens. Interestingly, extracts of HeLa 
cells transfected with plasmids pGFP-F2A 20 -CherryFP and 
pGFP-F2A 18 -CherryFP also contained an unexpected pro- 
tein with an estimated size of -45 kDa (shown with white 
arrow), similar to that detected in reticulocyte lysates, 
which reacted with the anti-GFP, but not anti-CherryFP 
antibodies. The nature of this protein, which could be 
generated by a leaky scanning mechanism, or, be a product 
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Figure 2 Efficiency of F2A cleavage in GFP-F2A-CherryFP context. F2A sequences of various lengths were used to co-express GFP and 
CherryFP proteins from pGFP-F2A-CherryFP constructs (a, schematic presentation) in vitro using TnT coupled transcription/ translation rabbit 
reticulocyte lysates (b) and transfected HeLa cells (c). For TnT, reticulocyte lysates were programmed with 20 ng of plasmid DNA and translation 
products were resolved by the 12% SDS-PAGE. For in vivo studies, HeLa cells were transfected with 1.5 ug of plasmid DNA and harvested 30 h 
post transfection. Cells were lysed in RIPA buffer and equal amounts of total protein for each transfection were loaded onto 12% SDS-PAGE gel. 
The proteins were transferred onto a nitrocellulose membrane, blocked in PBST containing 5% milk and probed with anti-GFP (upper blot) and 
anti-CherryFP (lower blot) antibodies overnight at 4°C. Detection of bound primary antibody was achieved by using respective secondary 
antibodies, followed by ECL detection, (d) Expression of fluorescent proteins was detected in transfected HeLa cells at 24 h and 48 h post 
transfection in Evos microscope using 4 x objective, (e) Cellular co-localisation of [GFP-F2A] and CherryFP 30 h post-transfection. HeLa cells pre- 
plated on cover slips were transfected with 0.5 ug of plasmid DNA and incubated for 30 h, fixed with 100% methanol and mounted using 
VECTASHIELD mounting medium with DAPI. The images were acquired with Deltavision microscope using 1 00 x objective using Resolve 3D 
software. All experiments were done in triplicates. 



of degradation of the uncleaved precursor is under further 
investigation. 

Interestingly, in the case of constructs which produced 
a high level of cleavage, the fluorescent signals from 
both GFP and CherryFP were distributed throughout the 
cytoplasm, in the case of constructs which produced low 
levels of cleavage, the fluorescent signals were accumu- 
lated mostly around the nucleus (Figure 2e). 

Tandem repeated cherry fluorescent proteins 

To determine if the C-terminal region of the protein up- 
stream of 2A affected the cleavage activity, we substituted 



the upstream gene (GFP) with that of CherryFP (Figure 3a). 
Again, these sequences were linked by either F2A 58 , F2A 50 , 
F2A 40 , F2A 30 , F2A 20 or F2A 18 . Similar to our previous re- 
sults, when reticulocyte lysates were programmed with 
constructs encoding F2As of diminishing lengths, the 
uncleaved polyprotein [CherryFP-F2A-CherryFP] formed a 
very minor translation product for F2A of 58 to 30aa 
(Figure 3b). In the case of these F2As, a doublet was ob- 
served corresponding to the [ CherryFP- F2 A] cleavage 
product. The fastest migrating product corresponded to the 
cleavage product downstream of F2A, CherryFP. The trans- 
lation profiles derived from all pCherryFP-F2A-CherryFP 
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Figure 3 Efficiency of F2A cleavage in CherryFP-F2A-CherryFP context. F2A sequences of various lengths were used to co-express two 
CherryFP proteins from pCherryFP-F2A-CherryFP constructs (a, schematic presentation) in vitro using coupled transcription/ translation rabbit 
reticulocyte lysates (b) and transfected HeLa cells (c). For TnT, reticulocyte lysates were programmed with 20 ng of plasmid DNA and translation 
products were resolved by the 12% SDS-PAGE. For in vivo studies, HeLa cells were transfected with 1.5 ug of plasmid DNA and harvested 30 h 
post transfection. Cells were lysed in RIPA buffer and equal amounts of total protein for each transfection were loaded onto 12% SDS-PAGE gel. 
The proteins were transferred onto a nitrocellulose membrane, blocked in PBST containing 5% milk and probed with anti-CherryFP (upper blot) 
and anti-FMDV2A (lower blot) antibodies overnight at 4°C Detection of bound primary antibody was achieved by using respective secondary 
antibodies, followed by ECL detection. All experiments were done in triplicates. 



constructs showed a high level of internal initiation, and 
produced minor translation products which corres- 
ponded in size to products from internal initiation 
at one of three start codons within the N-terminal 
CherryFP sequence (MetlO, Metl7 and Met23, with 
MetlO and Met23 being in more favourable Kozak con- 
texts (AaCAUGG, ttCAUGc and caCAUGG, respect- 
ively; upper-case letters indicate highly conserved bases, 
and lower-case letters - bases that can vary). It should 
be noted that internal initiation at MetlO produces the 
same decrease in molecular mass as that observed when 
the length of 2A was designed in lOaa increments. 

These observations in vitro were confirmed by our 
in vivo experiments. Cells were transfected with the panel 
of pCherryFP-F2A-CherryFP constructs and cleavage activ- 
ities determined by western blotting (Figure 3c). As with 
the pGFP-F2A-CherryFP constructs, uncleaved [CherryFP- 
F2A 58> 50> 4 0 -CherryFP] polyproteins were barely detectable 
by the anti-CherryFP antibodies. The major products in 
these cases were the [CherryFP-F2A] and CherryFP cleav- 
age products. Again, [CherryFP-F2A] migrated as a doub- 
let, with the fastest migrating product corresponding to the 
CherryFP cleavage product - of uniform size across the gel 
tracks (Figure 3c, upper blot). Although easily detectable 
for the 30aa version, the uncleaved form accumulated to 
a somewhat lesser degree than the two major cleavage 



products. As we observed for the in vitro translations, the 
upstream cleavage products (now [CherryFP-F2A]) were 
detected by an anti-FMDV2A antibody (Figure 3c, lower 
blot), but the uncleaved [CherryFP-F2A-CherryFP] was 
not. Using this antibody, two non-specific proteins were 
detected both in transfected and mock-transfected cells 
(Figure 3c, lower blot, highlighted with asterisks). Interest- 
ingly, the amounts of the upstream and downstream cleav- 
age products detected by both anti-CherryFP and anti- 
FMDV2A antibodies in HeLa cells transfected with 
pCherryFP-F2A 58 -CherryFP construct were lower than 
those expressed from constructs with shorter 2As. 

Similar to our data obtained for pGFP-F2A 2 o-CherryFP 
and pGFP-F2A 18 -CherryFP, both in reticulocyte lysates 
(Figure 3b) and in transfected HeLa cells (Figure 3c), the 
uncleaved polyproteins [CherryFP-F2A 20 -CherryFP] and 
[CherryFP-F2A 18 -CherryFP] represented the major trans- 
lation products. The cleavage products upstream of F2A 
([CherryFP-F2A 20 ] and [CherryFP-F2A 18 ]) migrated as 
doublets, as discussed above. The cleavage product down- 
stream of F2A, CherryFP, was of uniform size across all 
the gel tracks. Interestingly, it appeared that cleavage was 
more efficient when CherryFP was upstream of 2A, rather 
than GFP - shown using both anti-CherryFP (Figure 3c, 
upper blot) and anti-FMDV2A antibodies (Figure 3c, 
lower blot). We assume the higher amounts of cleaved 
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products arose from the changes in the C-terminal se- 
quences of the upstream gene (CherryFP versus GFP), as 
the linker sequences connecting them to F2A were identi- 
cal in all constructs (-RAKRSLE-). In addition, similar to 
the data presented for pGFP-F2A-CherryFP constructs, 
two extra bands were observed in both reticulocyte lysates 
and HeLa cells transfected with constructs containing 
F2As of 20 and 18 amino acids. The slow migrating 
band corresponded to protein produced by internal ini- 
tiation within the upstream Cherry gene (probably Met71; 
Figure 3b, highlighted with asterisk). This product was 
also detected in HeLa cells using anti-CherryFP anti- 
bodies (Figure 3c, upper blot, highlighted with asterisk). 
The extra doublet (-45 kDa) we observed using reticu- 
locyte lysates (Figure 3b, shown with white arrow) was 
also detected in HeLa cell extracts using anti-CherryFP 
antibodies (Figure 3c, upper blot, shown with white arrow), 
but not anti-FMDV 2A antibodies. 

Co-expression of CherryFP and GFP fluorescent proteins 
from pCherryFP-F2A-GFP constructs 

The downstream CherryFP sequences were replaced 
with GFP to create pCherryFP-F2A-GFP, again linked 
with the different lengths of F2A (Figure 4a). The pat- 
tern of accumulation of uncleaved polyproteins was as 



observed before: an increase as the length of F2A 
decreases. The uncleaved polyprotein formed the 
major product in the case of pCherryFP-F2A 2 o-GFP 
and pCherryFP-F2A 18 -GFP both in reticulocyte lysates 
(Figure 4b) and in HeLa cells (Figure 4c). The cleaved up- 
stream proteins, [CherryFP- F2 A] s, expressed from all six 
constructs, migrated as a doublet in the case of transla- 
tion in vitro (Figure 4b) and in vivo - in the latter case 
detected by western blotting using anti-CherryFP anti- 
bodies (Figure 4c, lower blot). The second major band 
corresponded to the downstream cleavage product, GFP 
(Figure 4b and 4c, upper blot). Translation profiles derived 
from pCherryFP-F2A-GFP constructs with F2As 58 _ 30 
showed a certain level of internal initiation (probably from 
Met71) within the upstream CherryFP gene. This pro- 
duced fast migrating bands corresponding to cleaved 
products of decreasing molecular weights (Figure 4b, in- 
dicated as 'CherryFP internal initiation ). 

The F2A 2 o and F2A 18 versions showed poor cleavage: 
the GFP cleavage product was almost undetectable in 
western blots using the anti-GFP antibody (Figure 4c, 
upper blot). Internal initiation was observed in the reticu- 
locyte lysate translation in vitro. The faster migrating 
bands corresponded to N-terminally truncated [CherryFP- 
F2A] proteins (probably produced by initiation at Met71; 
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Figure 4 Efficiency of F2A cleavage in CherryFP-F2A-GFP context. F2A sequences of various lengths were used to co-express CherryFP and 
GFP proteins from pCherryFP-F2A-GFP constructs (a, schematic presentation) in vitro using coupled transcription/ translation rabbit reticulocyte 
lysates (b) and transfected HeLa cells (c). For TnT, reticulocyte lysates were programmed with 20 ng of plasmid DNA and translation products 
were resolved by the 12% SDS-PAGE. For in vivo studies, HeLa cells were transfected with 1.5 ug of plasmid DNA and harvested 30 h post 
transfection. Cells were lysed in RIPA buffer and equal amounts of total protein for each transfection were loaded onto 12% SDS-PAGE gel. The 
proteins were transferred onto a nitrocellulose membrane, blocked in PBST containing 5% milk and probed with anti-GFP (upper blot) and anti- 
CherryFP (lower blot) antibodies overnight at 4°C. Detection of bound primary antibody was achieved by using respective secondary antibodies, 
followed by ECL detection. All experiments were done in triplicates. 
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Figure 4b, indicated as 'CherryFP internal initiation'). Initi- 
ation at this site also accounts for products migrating 
somewhat more rapidly than the full-length (uncleaved) 
polyprotein (Figure 4, b and c, highlighted with asterisks). 
This extra product was observed in both reticulocyte ly- 
sates programmed with the plasmids encoding F2As of 20 
and 18aa, and in extracts of HeLa cell transfected with 
these constructs. In these cases, the poor cleavage activity 
leads to an accumulation of uncleaved products and 
enabled sufficient material to accumulate to be observed 
directly (translation in vitro), or, be recognised by both 
anti-CherryFP and anti-GFP antibodies. 

Tine-tuning 7 of shorter F2A sequences 

To assess the utility of shorter F2A sequences for co- 
expression, we produced a series of constructs in which 
sequences encoding GFP and CherryFP reporter' pro- 
teins were linked by the panel of N-terminally single- 
residue truncated versions of F2A (F2As 2 5-2iJ Figure 1). 
These constructs were analysed together with the previ- 
ously described pGFP-F2A-CherryFP constructs contai- 
ning F2A 58 , F2A 30 , F2A 2 o and F2A 18 . Based upon the 
data shown in Figure 2, pGFP-F2A 58 -CherryFP / pGFP- 
F2A 30 -CherryFP were used as exemplars of highly effi- 
cient 2A cleavage and pGFP-F2A 2 o-CherryFP / pGFP- 
F2A 18 -CherryFP as exemplars of inefficient cleavage. 
Analysis of the translation products showed three major 
bands which corresponded to the three predicted translation 



products and their molecular weights: uncleaved [GFP- 
F2A-CherryFP] polyprotein and the two major cleavage 
products, [GFP-F2A] and CherryFP (Figure 5b). While 
the intensities of the latter bands were much higher in 
reticulocyte lysates programmed with constructs con- 
taining F2As of 30, 25, 24 and 23aa (in comparison with 
F2As of 22-18aa), the translation products derived from 
the four constructs encoding shorter 2A sequences were 
mostly uncleaved. Translation profiles derived from all 
constructs showed a low level of internal initiation 
(probably from Met88) within the upstream GFP gene. 
The fast migrating band corresponded to N-terminally 
truncated cleaved products [GFP-F2A] and slow mig- 
rating band corresponded to N-terminally truncated 
uncleaved polyprotein (Figure 5b, highlighted with 
asterisk). Again, the additional translation product of ~ 
45KDa was observed (more prominent with F2A ver- 
sions of 25aa and shorter), but was absent in constructs 
with F2A 30 and F2A 58 . This additional translation prod- 
uct was detected by anti-GFP (Figure 5c, upper blot, 
shown with white arrow), but not anti-CherryFP anti- 
bodies (Figure 5c, lower blot). From F2A 25 , the intensity 
of this band increased as the length of F2A decreased. 
In the cases of F2A of 23aa and longer, the translation 
products upstream of F2A ([GFP-F2A]) and down- 
stream ([CherryFP]) were easily detected by anti-GFP 
(Figure 5c, upper blot) and anti-CherryFP antibodies 
(Figure 5c, lower blot), respectively. In the case of 
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Figure 5 'Fine-tuning' of F2A cleavage in GFP-F2A-CherryFP context. F2A sequences of various lengths were used to co-express GFP and 
CherryFP proteins from pGFP-F2A-CherryFP constructs (a, schematic presentation) in vitro using coupled transcription/ translation rabbit 
reticulocyte lysates (b) and transfected HeLa cells (c). For TnT, reticulocyte lysates were programmed with 20 ng of plasmid DNA and translation 
products were resolved by the 12% SDS-PAGE. For in vivo studies, HeLa cells were transfected with 1.5 ug of plasmid DNA and harvested 30 h 
post transfection. Cells were lysed in RIPA buffer and equal amounts of total protein for each transfection were loaded onto 12% SDS-PAGE gel. 
The proteins were transferred onto a nitrocellulose membrane, blocked in PBST containing 5% milk and probed with anti-GFP (upper blot) and 
anti-CherryFP (lower blot) antibodies overnight at 4°C. Detection of bound primary antibody was achieved by using respective secondary 
antibodies, followed by ECL detection. All experiments were done in triplicates. 
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F2As 22 _ 20 , cleavage products were barely detectable, 
whilst for F2A 18 no cleavage products could be detected. 

Discussion 

Initial characterization of the FMDV 2A region by Ryan et 
al [10] showed that while the FMDV 2A/2B cleavage ac- 
tivity was mediated by residues located within the 19 
amino acid version of F2A, N-terminal truncation of F2A 
reduced cleavage efficiency. Ryan and Drew [14] further 
described the construction of plasmid (pCAT2AGUS) in 
which a 19 amino acid version of F2A was inserted be- 
tween the reporter genes chloramphenicol acetyl trans- 
ferase (CAT) and p-glucuronidase (GUS). Analyses of 
in vitro translation products derived from a series of 
constructs in which sequences were progressively de- 
leted from the N-terminal region of F2A showed that 
activity was ablated in versions with less than 13 resi- 
dues (-LKLAGDVESNPGP-). It was concluded that 
residues comprising the N-terminal portion of the F2A 
region whilst being influential were not critical for 
cleavage activity [14]. 

According to the model for 2A-mediated cleavage, resi- 
dues that may influence the activity are predicted to reside 
within the exit tunnel of the ribosome. The use of longer 
versions of F2A with N-terminal extensions derived from 
capsid protein ID (to a total of ~30aa) were reported to 
produce higher levels of cleavage [10-13]. Specifically, in a 
series of constructs based upon pGFP2AGUS, N-terminal 
extension of 2A by 5aa of ID (equivalent to F2A 25 ) im- 
proved cleavage, but extension by 14, 21 and 39aa (equiva- 
lent of F2A of 35, 40 and 58aa) produced complete 
cleavage and an equal stoichiometry of both translation 
products in vitro [11]. 

These observations are consistent with our in vitro and 
in vivo data on cleavage efficiency in constructs expressing 
GFP and CherryFP fluorescent reporter' proteins sepa- 
rated by F2As of 58, 50, 40 and 30aa. For all constructs 
tested, the in vitro analyses showed the polyprotein was ef- 
ficiently cleaved, the upstream protein bearing the F2A 
'tail' at its C-terminus. In transfected HeLa cells, however, 
small amounts of uncleaved polyprotein were detected by 
western blotting (using anti-GFP / anti-CherryFP, but not 
anti-FMDV2A, antibodies) in constructs encoding the 
F2A 30 version. Swapping the gene order had no effect on 
cleavage efficiency when longer versions of 2As were used, 
showing that for these particular lengths, cleavage is not 
sensitive to the C- terminal sequence of the upstream pro- 
tein - since this has already transited through the exit tun- 
nel of the ribosome. 

Further experiments demonstrated that the cleavage 
efficiency was gradually reduced with each N-terminal 
amino acid deletion, shown by both our in vitro and 
in vivo data when pGFP-F2A-CherryFP constructs with 
F2As of 25aa to 18aa were used. The length of F2A of 



23aa was demonstrated to be the most efficient of the 
shorter versions of 2A, and deletion of just one amino 
acid from its N-terminus resulted in a substantial de- 
crease in cleavage efficiency, whereas truncation to 
F2A 18 essentially ablated cleavage: data apparently at 
variance with those previously reported for F2A 20 , where 
cleavage was efficient [10, 11; discussed below]. Cleavage 
efficiency was improved somewhat in constructs where 
the upstream gene was CherryFP (rather than GFP). 

The exit tunnel of the ribosome accommodates 30-40 
amino acids. For FMDV, the C-terminal region of pro- 
tein ID (immediately upstream of 2 A) must have evo- 
lved to interact with the exit tunnel such that F2A (18aa 
long) mediates cleavage highly efficiently. In the case of 
artificial polyproteins, the authentic' upstream context 
of 2A is replaced with foreign gene or linker sequences 
which may create an unfavourable environment for ribo- 
some skipping': their presence inside the exit tunnel 
may adversely affect the interaction of F2A with the 
ribosome. Low cleavage efficiency may arise, therefore, 
due to (i) the nature of the C-terminal region of the up- 
stream gene, or, (ii) the manner in which it is linked to 
the shorter versions of F2A. 

The data presented here is in agreement with many pre- 
viously published studies that successfully used F2A and 
'2A-like' sequences in a wide range of heterologous sys- 
tems, notably, adoptive cell cancer gene therapies, genetic 
engineering of human stem cells and the co-expression of 
transcription factors for the induction of pluripotent stem 
cells [11,19,20,22-25,29-35]. Our findings also help explain 
previously published studies in which incomplete cleavage 
was observed. To avoid using longer 2A sequences (which 
remain attached at the C-terminus of the upstream pro- 
tein), researchers used shorter versions of F2A. Conflicting 
cleavage efficiencies have been reported for 2As of the 
same lengths [11,14,19,22,25,27]. For example, Furler et al 
[18] showed that within the context of a polio virus vector, 
2A-mediated cleavage was incomplete when F2A 24 was 
used. Fang et al [19] described similar results for the con- 
stitutive expression of full-length two-chain antibodies 
from a single ORF by linking the heavy and light antibody 
chains with F2A 24 . Different cleavage efficiencies depend- 
ing on protein arrangement were observed when GFP, 
EGFP and hrGFP were co-expressed with cytochrome 
P450 2B1 (CYP2B1) via F2A 20 [36]. In addition, Donnelly 
and co-workers, using F2A 20 in GFP2AGUS and GUS2- 
AGFP, also reported the presence of small amounts of 
uncleaved polyprotein [11]. With F2A 19 , cleavage was 
reported to be less efficient (70%) in [CFP-2A-PAC] con- 
text, with some uncleaved polyprotein still present [37], 
while de Felipe and co-workers [17] used an F2A 18 version 
in the generation of retroviral vectors and also observed 
uncleaved polyprotein. The cleavage was, however, repor- 
ted to be complete when the sequence was extended to 
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58aa [17]. Finally, Ma and Mitra used F2A 17 to co-express 
GUS and CAT proteins in different arrangements and 
demonstrated that [GUS-2A-CAT] and [GUS-2A-GUS] 
were cleaved to completion, while [CAT-2A-GUS] was 
not [38]. 

Interestingly, we show here that pGFP-F2A 20 -CherryFP 
and pGFP-F2A 18 -CherryFP produced poor cleavage both 
in vitro and in vivo, apparently at variance with our own 
previously published data [10,11]. In these studies, the 
pGFP2AGUS construct contained the same upstream 
GFP gene and the same F2A 2 o sequence and produced ef- 
ficient cleavage. Sequence comparison of pGFP-F2A 20 - 
CherryFP and pGFP2AGUS constructs shows that the dif- 
ferences are only observed in the sequence immediately 
upstream of 2A (RAKRSLE vs SGSRGAC, respectively). 
Ironically, this seven amino acid stretch is a part of neither 
F2A nor C terminus of the upstream GFP, but is actually 
composed of two different restriction sites introduced for 
cloning purposes during the construction of both plas- 
mids. The effect of the differences in this particular region 
on the efficiency of 2A-mediated cleavage is the subject of 
a separate report [39]. 

Swapping the order of fluorescent reporter' proteins 
in our expression systems had no effect on the general 
pattern of cleavage efficiency. However, while two genes 
could be effectively co-expressed (irrespective of pos- 
ition) when linked by a longer version of F2A (30aa and 
longer), we did observe a small effect for the shorter 
F2A 2 o and F2A 18 versions. 

Conclusions 

In summary, F2As are used widely in biotechnology and 
biomedicine and we wished to identify the source(s) of ap- 
parent discrepancies in the literature with regards to cleav- 
age efficiency. While constructs containing F2A of 58- 
23aa in length exhibited enhanced abilities for cleavage 
compared to the constructs with F2A of 22aa and shorter, 
with least efficient 2 As being those of 20aa and 18aa, we 
suggest that researchers use F2A 30 if opting for a shorter 
sequence. This 2A proved to be most optimal in terms of 
both length and cleavage efficiency, and, in our hands, was 
unaffected by the sequence of the upstream gene. A more 
extensive characterization of sequence requirements be- 
tween the C-terminal region of the upstream protein and 
the N-terminal part of F2A is underway to test the specu- 
lation that this particularly interesting region of the se- 
quence may have certain effect on 2A-mediated cleavage. 
Ultimately, the fine-tuning of the F2A sequence can help 
researchers in their efforts to efficiently co-express various 
multiple proteins for successful biomedical strategies. 

Methods 

The following reagents were obtained from commercial 
suppliers: Pfu polymerase, T4 DNA ligase, restriction 



enzymes, JM109 competent cells, Quick TnT rabbit re- 
ticulocyte lysate kit (Promega, Madison, WI, USA); S 35 
methionine (MP Biomedicals, Santa Ana, CA, USA); 
EDTA-free protease inhibitor cocktail ( Sigma- Aldrich 
Company Ltd, Dorset, UK); pcDNA3.1, anti-RFP, rabbit 
purified polyclonal antibody against Red Fluorescent 
Protein (1:1000 dilution), anti-GFP, mAb 3E6, anti- 
Green Fluorescent Protein, mouse monoclonal antibody 
(1:1000 dilution), anti-p-tubulin, mouse monoclonal 
antibody (1:2000 dilution), Lipofectamine2000 transfec- 
tion reagent, iBlot Gel Transfer Stacks, nitrocellulose 
(Life Technologies Ltd, Paisley, UK); secondary HRP- 
conjugated polyclonal goat anti-mouse and anti-rabbit 
antibodies (Dako UK Ltd, Ely, UK); rabbit anti-FMDV 
2A antiserum raised against the synthetic peptide NH2- 
LLNFDLLKLAGDVESNPGP-COOH; EZ-ECL Chemilu- 
minescence Detection Kit for HRP (Geneflow Ltd, 
Elmhurst, UK); VECTASHIELD mounting medium with 
DAPI (VECTOR Laboratories Ltd, Peterborough, UK); 
Oligonucleotides were purchased from Integrated DNA 
Technologies (Coralville, Iowa, USA). 

Construction of plasmids 

pGFP-F2As-CherryFP constructs (F2As of 58, 50, 40, 30, 
25-20, and 18aa, sequences shown in Figure 1). To make 
pGFP-F2A 58 -CherryFP construct, F2A 58 sequence, obtained 
by digesting pSV7 plasmid (kindly provided by Sandra 
Varte) with Nsil and Apal restriction enzymes, was cloned 
into pJC3 plasmid (pcDNA3.1 -based with T2A linking GFP 
and CherryFP genes) in place of T2A sequence using these 
restriction sites. pGFP-F2A 58 -CherryFP construct was then 
used as a template to produce deletions of F2A. Forward 
oligonucleotides incorporating Xhol restriction site: (5'- 
CTCGAGAAGAGGGCCGAAACATACTGTCCAAGG-3' 
for F2A 50 , 5 -CTCGAGTTGCTGGCAATCCACCCAAC 
TGAAGCC-3 ' for F2A 40 , 5 -GCGCTCGAGCACAAAC 
AGAAAATTGTGGCACCGGTG-3' for F2A 30 , 5-GCG 
CTCGAGGTGGCACCGGTGAAACAGACTTTG-3 ' for 
F2A 25 , 5 ' -GCGCTCG AGGCACCGGTGAAACAG ACTTT 
G-3' for F2A 24 , 5' -GCGCTCG AGCCGGTGAAACAGAC 
TTTG AATTTTG-3 ' for F2A 23 , 5 -GCGCTCGAGGTGA 
AACAGACTTTGAATTTTGAC-3 ' for F2A 22 , 5-GCGC 
TCGAGAAACAGACTTTGAATTTTGACCTTC-3 ' for 
F2A 21 , 5 -GCGCTCG AGC AG ACTTTGAATTTTGACC 
TTCTCAAG-3' for F2A 20 , 5 -GCGCTCGAGTTGAATT 
TTGACCTTCTCAAGTTGGCG-3 ' for F2A 18 ) and a re- 
verse oligonucleotide incorporating NotI restriction site (5'- 
TAGAAGGCACAGTCGAGGC-3 ') were used to amplify 
F2As-CherryFP with 2As of various lengths. The PCR pro- 
ducts were digested with Xhol and NotI restriction enzymes 
and cloned into pGFP-F2A 58 -CherryFP in place of F2A 58 - 
CherryFP sequence using Xhol and NotI restriction sites. 

pCherryFP-F2As-CherryFP constructs (F2As of 58, 50, 
40, 30, 20, and 18aa). Respective pGFP-F2As-CherryFP 
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constructs were digested with BamHI and Nsil restriction 
enzymes to substitute GFP sequence with that of 
CherryFP. In order to do that, pJC3 plasmid was used as a 
template to amplify CherryFP gene using forward (5'- 
GCGGGATCCATGGTGAGCAAGGGCGAGGAGGATA 
AC -3') and reverse (5 -GCGATGCATTTTGTACAAT 
TCATCCATGCCGCCG -3') primers incorporating, re- 
spectively, BamHI and Nsil restriction sites. PCR products 
were digested with BamHI and Nsil restriction enzymes 
and ligated into prepared vectors. 

pCherryFP-F2As-GFP constructs (F2As of 58, 50, 40, 
30, 20 and 18aa). Respective pCherryFP-F2 As -CherryFP 
constructs were digested with Apal and EcoRI restric- 
tion enzymes to substitute downstream CherryFP gene 
with that of GFP. pJC3 plasmid was used as a template 
to amplify GFP gene using forward (5'-GCGGGGCCC 
GATATCGTGTCCAAAGGGGAAGAG-3 ') and reverse 
(5-GCGGAATTCTTACTTATACAGCTCGTCCATGC 
CGAG-3') primers incorporating Apal and EcoRI re- 
striction sites. PCR products were digested with Apal 
and EcoRI restriction enzymes and ligated into pre- 
pared vectors. 

Transcription and translation 

Plasmid constructs were used to programme Quick TnT 
Coupled Transcription/ Translation reticulocyte lysate 
systems (Promega) and the reactions repeated three times. 
Translation reactions (10 \A) were performed according to 
the manufacturers instructions. Briefly, 20 ng of plasmid 
DNA was used to programme rabbit reticulocyte lysates 
containing 35 S methionine (10 \iCi) and the reactions were 
incubated at 30°C for 1 h. Translation reactions were 
stopped by the addition of 2x protein dissociation buffer. 
Protein products were analysed by 12% SDS-PAGE. Con- 
trols of TnT reactions programmed with 'empty' vectors 
produced no translation products (data not shown). 

Expression in HeLa cells 

HeLa cells were maintained in DMEM medium supple- 
mented with 10% FCS. For western blotting, the cells were 
transfected in 60 mm dishes (pre-plated 20 h earlier to 
60% confluency) with 1.5 \ig of plasmid DNA and 7 \A of 
Lipofectamine2000 (Invitrogen) in a final volume of 
400 \A OptiMEM. Transfection mix was added to 4 ml of 
antibiotic-free serum-containing medium and cells were 
incubated for a total of 30 h post-transfection. All trans- 
fections were performed in triplicate. Controls with 
empty' expression vector transfected cells produced no 
translation products (western blotting) or fluorescence 
(data not shown). 

Western blotting 

30 h post-transfection, cells were washed twice with 1 ml 
of PBS and harvested in 1 ml of PBS by centrifugation at 



2000 rpm for 5 min. Whole-cell lysates were prepared in 
70 \A of radioimmuno precipitation assay buffer, RIPA 
(150 mM NaCl, 10 mM Tris pH 7.4, 1% Triton X-100, 1% 
Na deoxycholate, 0.1% SDS, and freshly added 1/20 vo- 
lume of EDTA-free protease inhibitor cocktail) by incuba- 
tion on ice for 30 min and centrifugation at 12000 rpm 
for 20 min at 4°C. Cellular debris was discarded and 
supernatants containing proteins were analysed by 12% 
SDS-PAGE. The proteins were then transferred to a 
nitrocellulose membrane, which was probed with anti- 
GFP, anti-CherryFP or anti-FMDV 2A primary antibodies 
overnight after blocking in PBST containing 5% milk for 
1 h. Following overnight incubation, the membranes were 
washed three times in PBST. Detection of bound primary 
antibodies was performed with respective HRP-secondary 
antibodies (Dako) in PBST containing 5% milk for 1 h, the 
membranes were washed three times with PBST, rinsed in 
deionised water and subjected to enhanced chemilumi- 
nescence by incubating it in freshly-prepared visualisation 
solution for 2 min. Membrane was exposed to an auto- 
radiograph film for 3 to 45 seconds. 

Fluorescence microscopy 

Transfection of HeLa cells was performed as described 
above for western blotting, however, 0.5 (ig of plasmid 
DNA and 5 ul of Lipofectamine2000 in a final volume of 
200 \A OptiMEM were added to 2 ml of media in each 
well of a 6-well plate containing coverslips. 30 h post- 
transfection the cells were washed twice with 1 ml of PBS, 
fixed with 1 ml of ice-cold 100% methanol for 10 min and 
washed again with 1 ml of PBS. The coverslips were moun- 
ted in VECTASHIELD (an antifade mounting medium, 
VECTOR Laboratories). The images were acquired with 
Deltavision microscope using 100 x objective using Resolve 
3D software. Experiments were done in triplicates. 
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